Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

نویسندگان

  • Nick Johnston
  • Damien Vincent
  • David Minnen
  • Michele Covell
  • Saurabh Singh
  • Troy T. Chinen
  • Sung Jin Hwang
  • Joel Shor
  • George Toderici
چکیده

We propose a method for lossy image compression based on recurrent, convolutional neural networks that outperforms BPG (4:2:0), WebP, JPEG2000, and JPEG as measured by MS-SSIM. We introduce three improvements over previous research that lead to this state-of-the-art result. First, we show that training with a pixel-wise loss weighted by SSIM increases reconstruction quality according to several metrics. Second, we modify the recurrent architecture to improve spatial diffusion, which allows the network to more effectively capture and propagate image information through the network’s hidden state. Finally, in addition to lossless entropy coding, we use a spatially adaptive bit allocation algorithm to more efficiently use the limited number of bits to encode visually complex image regions. We evaluate our method on the Kodak and Tecnick image sets and compare against standard codecs as well recently published methods based on deep neural networks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Adaptive Block Truncation Coding for Image Compression

The Block Truncation Coding (BTC) is one of the lossy image compression algorithms. In this paper, we have proposed a method called the Improved Adaptive Block Truncation Coding (IABTC) based on Adaptive Block Truncation Coding (ABTC). The feature of inter-pixel redundancy is exploited to reduce the bit-rate further by retaining the quality of the reconstructed images. The proposed method outpe...

متن کامل

Region of interest coding in volumetric images with shape-adaptive wavelet transform

We evaluated some arbitrary-shape ROI (Region of Interest) coding techniques for three-dimensional volumetric images, and through the observation we propose a flexible ROI coding with efficient compression performance. In arbitrary-shape ROI coding, the object in an image is coded with higher fidelity than the rest of the image, together with the shape information, which indicates the region of...

متن کامل

(AIC) is a still image compression system which combines the intra frame block prediction from H.264 with a JPEG-based discrete cosine transform followed by context adaptive binary arithmetic

JPEG is a popular DCT-based still image compression standard, which has played an important role in image storage and transmission since its development. Advanced Image Coding (AIC) is a still image compression system which combines the intra frame block prediction from H.264 with a JPEG-based discrete cosine transform followed by context adaptive binary arithmetic coding (CABAC). It performs m...

متن کامل

Orientation adaptive subband coding of images

In the subband coding of images, directionality of image features has thus far been exploited very little. The proposed subband coding scheme utilizes orientation of local image features to avoid the highly objectionable Gibbs-like phenomena observed at reconstructed image edges with conventional subband schemes at low bit rates, At comparable bit rates, the subjective image quality obtained by...

متن کامل

Embedded to Lossless Image Coding (elic)

In this paper, a novel approach for Embedded to Lossless Image Coding (ELIC) is presented. With embedded to lossless coding, lossy encodings of the image at all lower bit rates are embedded in the lossless bit stream. This algorithm uses wavelet transforms that map integers to integers with embedded coding. The coding is based on bit significance criteria in which coefficients that become signi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1703.10114  شماره 

صفحات  -

تاریخ انتشار 2017